pubtator 3
AI-assisted Knowledge Discovery in Biomedical Literature to Support Decision-making in Precision Oncology
He, Ting, Kreimeyer, Kory, Najjar, Mimi, Spiker, Jonathan, Fatteh, Maria, Anagnostou, Valsamo, Botsis, Taxiarchis
The delivery of appropriate targeted therapies to cancer patients requires the complete analysis of the molecular profiling of tumors and the patient's clinical characteristics in the context of existing knowledge and recent findings described in biomedical literature and several other sources. We evaluated the potential contributions of specific natural language processing solutions to support knowledge discovery from biomedical literature. Two models from the Bidirectional Encoder Representations from Transformers (BERT) family, two Large Language Models, and PubTator 3.0 were tested for their ability to support the named entity recognition (NER) and the relation extraction (RE) tasks. PubTator 3.0 and the BioBERT model performed best in the NER task (best F1-score equal to 0.93 and 0.89, respectively), while BioBERT outperformed all other solutions in the RE task (best F1-score 0.79) and a specific use case it was applied to by recognizing nearly all entity mentions and most of the relations.
- North America > United States > Maryland > Baltimore (0.05)
- Europe (0.04)
- Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
PubTator 3.0: an AI-powered Literature Resource for Unlocking Biomedical Knowledge
Wei, Chih-Hsuan, Allot, Alexis, Lai, Po-Ting, Leaman, Robert, Tian, Shubo, Luo, Ling, Jin, Qiao, Wang, Zhizheng, Chen, Qingyu, Lu, Zhiyong
PubTator 3.0 (https://www.ncbi.nlm.nih.gov/research/pubtator3/) is a biomedical literature resource using state-of-the-art AI techniques to offer semantic and relation searches for key concepts like proteins, genetic variants, diseases, and chemicals. It currently provides over one billion entity and relation annotations across approximately 36 million PubMed abstracts and 6 million full-text articles from the PMC open access subset, updated weekly. PubTator 3.0's online interface and API utilize these precomputed entity relations and synonyms to provide advanced search capabilities and enable large-scale analyses, streamlining many complex information needs. We showcase the retrieval quality of PubTator 3.0 using a series of entity pair queries, demonstrating that PubTator 3.0 retrieves a greater number of articles than either PubMed or Google Scholar, with higher precision in the top 20 results. We further show that integrating ChatGPT (GPT-4) with PubTator APIs dramatically improves the factuality and verifiability of its responses. In summary, PubTator 3.0 offers a comprehensive set of features and tools that allow researchers to navigate the ever-expanding wealth of biomedical literature, expediting research and unlocking valuable insights for scientific discovery.
- North America > Canada > Quebec > Montreal (0.14)
- Asia > China > Liaoning Province > Dalian (0.04)
- North America > United States > Maryland > Montgomery County > Bethesda (0.04)
- Research Report > New Finding (0.93)
- Research Report > Experimental Study (0.93)